Efficient non-uniform time-scaling of speech with WSOLA for CALL applications

نویسنده

  • M. Demol
چکیده

We consider the applicability of time-scaling for Computer Assisted Language Learning Applications (CALL) and present an efficient algorithm for non-uniform time-scaling. Formal listening tests show a general preference for this non-uniform time-scaling and indicate a dependence of this preference on such factors as the length of the utterance and the desired amount of time-scaling.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overlap-add methods for time-scaling of speech

In this tutorial on time scaling we follow one particular line of thought towards computationally efficient high quality methods. We favor time scaling based on time-frequency representations over model based approaches, and proceed to review an iterative phase reconstruction method for time-scaled magnitude spectrograms. The search for a good initial phase estimate leads us to consider synchro...

متن کامل

Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals

Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...

متن کامل

An Overlap-add Technique Based on Waveform Similarity (wsola) for High Quality Time-scale Modification of Speech

A concept of waveform similarity is proposed for tackling the problem of time-scale modification of speech, and is worked-out in the context of short-time Fourier transform representations. The resulting WSOLA algorithm produces high quality speech output, is algorithmically and computationally efficient and robust, and allows for on-line processing with arbitrary timescaling factors that may b...

متن کامل

CHAPTER 15 Time - Domain and Frequency - Domain Techniques for Prosodic Modification of Speech

1. Introductjon 2 General consjderatjons on tjrne-scaling and pjtch-scaling 2.1. Asjrnplemodelforvojcedspeech 2 Tjrne-scalernodificatjon 3 Pjtchl r ifi tj 4 ossjble approaches to prosodic modificatjon 3. The short tjrne Fourjer transforrn and overlap-add synthesjs 3.1. naly js 2 Modifi tjo . 3 Sy th sjs 4. im -scalingtechniques 4 OLAt m -scaling 4.2. y chroniz dOLA rne-scaling 3 WSOLA: An overl...

متن کامل

Waveform similarity based overlap-add (WSOLA) for time-scale modification of speech: structures and evaluation

A synchronization criterion for overlap-add time-scale modification is derived through a least squares estimation of the modified short-time Fourier transform. Based on this finding, a structural time-domain framework for time-scale modification is described. One efficient variant, which was called the Waveform Similarity based Overlap-Add (WSOLA) method, produces high quality output when appli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004